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Abstract: We compute explicit bounds in the Gaussian approximation of functionals of infinite Rade- 
i macher sequences. Our tools involve Stein's method, as well as the use of appropriate discrete Malliavin 
, operators. Although our approach does not require the classical use of exchangeable pairs, we employ a 
' chaos expansion in order to construct an explicit exchangeable pair vector for any random variable which 
■ depends on a finite set of Rademacher variables. Among several examples, which include random variables 
which depend on infinitely many Rademacher variables, we provide three main appHcations: (i) to CLTs 
^4 I for multilinear forms belonging to a fixed chaos, (ii) to the Gaussian approximation of weighted infinite 
^ ' 2-runs, and (iii) to the computation of expHcit bounds in CLTs for multiple integrals over sparse sets. This 
. last application provides an alternate proof (and several refinements) of a recent result by Blei and Janson. 
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1 Introduction 

^ I The connection between Stein's method (see e.g. [71 [Ml EEl [39]) and the integration by parts 
Q ' formulae of stochastic analysis has been the object of a recent and quite fruitful study. The 
papers [22l[23l[2ll deal with Stein's method and Malliavin calculus (see e.g. [26]) in a Gaussian 
[ setting; in |25| one can find extensions involving density estimates and concentration inequalities; 
^ [ the paper [28] is devoted to explicit bounds, obtained by combining Malliavin calculus and 
Stein's method in the framework of Poisson measures. Note that all these references contain 
OO , examples and applications that were previously outside the scope of Stein's method, as they 
involve functionals of infinite-dimensional random fields for which the usual Stein-type techniques 
(such as exchangeable pairs, zero-bias transforms or size-bias couplings), which involve picking 
^ , a random index from a Gnite set, seem inappropriate. 

^ I The aim of this paper is to push this line of research one step further, by combining Stein's 

method with an appropriate discrete version of Malliavin calculus, in order to study the normal 
approximation of the functionals of an infinite Rademacher sequence. By this expression we 
simply mean a sequence X = {Xn : n ^ 1} of i.i.d. random variables, such that P{Xi = 1) = 
P{Xi = —1) = 1/2. A full treatment of the discrete version of Malliavin calculus adopted in this 
paper can be found in [3l] or [32]; Section [231 below provides a short introduction to the main 
objects and results. 
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The main features of our approach are the following: 

(i) We will be able to deal directly with the normal approximation of random variables possibly 
depending on the whole infinite sequence X. In particular, our results apply to random 
elements not necessarily having the form of partial sums. 

(ii) We will obtain bounds expressed in terms of the kernels appearing in the chaotic decom- 
position of a given square-integrable random variable. Note that every square-integrable 
functional of X admits a chaotic decomposition into an orthogonal sum of multiple inte- 
grals. 

(iii) We employ the chaotic expansion in order to construct an explicit exchangeable pair vec- 
tor for any random variable which depends on a finite set of Rademacher variables. In 
particular, this exchangeable pair satisfies a linearity condition in conditional expectation 
exactly, which allows the normal approximation results from Reinert and Rollin [34| to be 
applied. 

(iv) In the particular case of random variables belonging to a fixed chaos, that is, having the 
form of a (possibly infinite) series of multilinear forms of a fixed order, we will express our 
bounds in terms of norms of contraction operators (see Section [2^ . These objects also 
play a central role in the normal and Gamma approximations of functionals of Gaussian 
fields (see [22l EH |2ll EZ]) and Poisson measures (see [28l [29l |30]). In this paper, we 
shall consider kernels defined on discrete sets, and the corresponding contraction norms 
are integrals with respect to appropriate tensor products of the counting measure. 

The use of contraction operators in the normal approximation of Rademacher functionals 
seems to us a truly promising direction for further research. One striking application of these 
techniques is given in Section [6l where we deduce explicit bounds (as well as a new proof) for a 
combinatorial central limit theorem (CLT) recently proved by Blei and Janson in [1], involving 
sequences of multiple integrals over finite "sparse" sets. In the original Blei and Janson's proof, the 
required sparseness condition emerges as an efficient way of re-expressing moment computations 
related to martingale CLTs. In contrast, our approach does not involve martingales or moments 
and (quite surprisingly) the correct sparseness condition stems naturally from the definition of 
contraction. This yields moreover an explicit upper bound on an adequate distance between 
probability measures. 

Another related work is the paper by Mossel et al. [21], where the authors prove a general 
invariance principle, allowing to compare the distribution of polynomial forms constructed from 
different collections of i.i.d. random variables. In particular, in [21] a crucial role is played 
by kernels "with low influences": we will point out in Section [4] that influence indices are an 
important component of our bounds for the normal approximation of elements of a fixed chaos. 
In Section [Ql we also show that the combination of the findings of the present paper with those 
of [3] yields an elegant answer to a question left open by Blei and Janson in [^. 

A further important point is that our results do not hinge at all on the fact that N is an 
ordered set. This is one of the advantages both of Stein's method and of the Malliavin-type, 
filtration-free stochastic analysis. It follows that our findings can be directly applied to i.i.d. 
Rademacher families indexed by an arbitrary discrete set. 

The reader is referred to Chatterjee [5] for another Stein-type approach to the normal ap- 
proximation of functionals of finite i.i.d. vectors. Some connections between our findings and 
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the results proved in [5] (in the special case of quadratic functionals) are pointed out in Section 

The paper is organized as follows. Section [2] contains results on multiple integrals, Stein's 
method and discrete Malliavin calculus (in particular, a new chain rule is proved). Section [3] 
provides general bounds for the normal approximation of Rademacher functionals; it gives first 
examples and the exchangeable pair construction. In Section [3] we are interested in random 
variables belonging to a fixed chaos. Section [5] focuses on sums of single and double integrals and 
related examples. Section [6] is devoted to further applications and refinements, mainly involving 
integrals over sparse sets. Technical proofs are deferred to the Appendix. 

2 Framework and main tools 
2.1 The setup 

As before, we shall denote by X = {X„ : n ^ 1} a standard Rademacher sequence. This means 
that X is a collection of i.i.d. random variables, defined on a probability space (fi, J^, P) and 
such that P [Xi = 1] = P [Xi = —1] = 1/2. To simplify the subsequent discussion, for the rest of 
the paper we shall suppose that Q = { — 1, 1}^ and P = [^{S-i + di}]^ . Also, X will be defined 
as the canonical process, that is: for every uj = {uji,uj2, ■■■) G Xn{co) = ojn- Starting from X, 
for every N ^ 1 one can build a random signed measure ^X[x,N) on {1, A^}, defined for every 
A C {!,... ,A^} as follows: 

^{x,N) {A) = ^Xj. 

It is clear that = (y{^Ji[x,N) : A^ ^ !}• We shall sometimes omit the index A^ and write 

t^{x,N) = ^^x^ whenever the domain of integration is clear from the context. 

We define the set D = {{11,12) G : ii = to be the diagonal of N^. For n ^ 2, we put 

A„ = {(ii, in) e N" : the i/s are all different} , (2.1) 

and, for A^, n ^ 2, 

A^ = A„n{l,...,Arr (2.2) 

(so that = if A^ < n). The random signed measures fJ.(x,N) satisfy that, for every 
A,Bc{l,...,N}, 

fifl^^ {[AxB]nD)=Y^ =K{AnB) = Uj-j^AnB}, (2.3) 

j 

where k is the counting measure on N. The application A 1— > M^at) ([^ x ^] H Z?) is called 
the diagonal measure associated with fJ-(^x,N)- The above discussion simply says that, for 
every A^ ^ 1, the diagonal measure associated with H(^x,N) is the restriction on {1, A^} of the 
counting measure k. Note that, for finite sets A, B one has also 

E[f,xiA)fixiB)]=KiAnB). 
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Remark 2.1 In the terminology e.g. of [30] and [40], the collection of random variables 

{fi^x,NM)-Ac{l,...,N},N;^l} 

defines an independently scattered random measure (also called a completely random measure) 
on N, with control measure equal to k. In particular, if Ai,...,^^ is a collection of mutually 
disjoint finite subsets of N, then the random variables //^(^i), iixi^k) are mutually indepen- 
dent. 

2.2 The star notation 

For n ^ 1, we denote by £^ (N)" the class of the kernels (= functions) on N" that are square 
integrable with respect to k®"; l'^ (N)"" is the subset of £^ (N)" composed of symmetric kernels; 

(N)^ is the subset of £^ (N)" composed of kernels vanishing on diagonals, that is, vanishing on 
the complement of A„ ; £q (N)"*^ is the subset of £q (N)"" composed of symmetric kernels. 

For every n,7n ^ 1, every r = 0, ...,n A ?n, every I = 0, ...,r, and every / G (N)°" and 
g & io (N)"*", we denote hy f -k^ g the function of n + m — r — / variables obtained as follows: 
r variables are identified and, among these, I are integrated out with respect to the counting 
measure k. Explicitly, 

/ *r 5 (^1 ) •• •) "^n+m—r—l ) 

— ^ ^ / (il ) •• •) in— r J in— r+1 ) •• •) in— Z ) C^l ) • •• ) O!:;) 
ai,...,a; 

^ 9 (^n— Z+1) •••) in+m—r—l'i in—r+li •••i '^n—l'i Q-lj •••) (^l) j 

note that, since / and g vanish on diagonals by assumption, the sum actually runs over all vectors 
(ai,...,a/) G A/. For instance, 

f*0 9 ik, ...,in+m) = f ®9 (il, ...,in+m) = / («!, . .. , in ) 5 ( Wl ) •••,in+m) , 

and, for r G {1, . . . , n A m}, 

/ *r 5 (^1) Wm-2r) 

— ^ ^ / (il 5 • •• 1 in— r ) fll ) •• •) '^r ) 5 (^n— r+1 ) •• •! in+m— 2r ) Ol , .. ., Or ) . 

In particular, \{ r = m = n 

f*m9= ^ /(ai,---,am)5'(ai)"->«m) = (/,5)^2(p^)»m. 

ai,...,a„i 

Example 2.2 If n = ?7t, = 2, one has 

f-*^o9{ii,i2,i3,k) = f {ii,i2)g{i3,H) ; / 5 (n, «2, is) = f {h^h) 9 {k^k) ; 
/*i 5'(n,i2) = Sa/(ii,a) c/(i2,a) ; / *2 5 (n, ^2) = / (ii, i2) 5 (n, ^2) ; 
/ *2 5 (*) = ^a/ (*> «) 5 a) ; f *l g = T^a^aif (^1,02) g (ai, 0^2) • 

Remark 2.3 In general, a kernel of the type f -^l g may be not symmetric and may not vanish 
on diagonal sets. 
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The next two lemmas collect some useful facts for subsequent sections. Their proofs are 
postponed in the appendix. 

Lemma 2.4 Let f £ (N)°'', g £ (N)°™ (n, m ^ 1) and ^ / ^ r ^ n A m. Then: 



1. /4<7G^MNf("+"^-^-') an^f||/45ll,2(j,)«(n+™-.-o ^ 

2. ifn^2 then 



max 

j&N 



(6i,...,b„_i)eN"-i 



and, if I = 1, 



,n, 



\9\ 



£2(N) 



|2™^«nXmax Y] f{j,bi,...,bn-i) 

(6i,...,fe„_i)6N"-i 



11/ *i 5'll£2(p^)®(n + »n-2I + l) — 1 1 / (j ) " ) */_ 1 5 0) " ) 1 1 ^2 (p^) ® (" + m-20 



(2.4) 



5- /ll^2(N)«{2n-i) = II/*" Vll£2(N) and, for every I = 2,..., n {n^ 2), 

\\f *\ ^ /||£2(N)®(2n-2i+l) ^ ll/*|_l / X 1a=(^_;^jj ||£2(p^)«(2n-2i+2) ^ ll/*|_l / 1 1 £2 (N)® (2n-2i+2) . 

Here, stands for the complement of the set AJ^, that is, A^ is the collection of all vectors 
{ii, ...jiq) such that = ii for at least one pair {l,k) such that I 7^ k. 

Remark 2.5 1. According e.g. to [21], the quantity 

f\j,bl,...,bn-l) 
(fei,...,b„_i)GN"-i 

is called the influence of the j^^ coordinate on /. 

2. When specializing Lemma [2^ to the case n = 2, one gets 

2 

^ 11/*^ /II? 



max 



.ieN 



(2.5) 



Afx 1a-||?2(n)»2 ^ 11/ *1 /||?2(N)»2 = Trace{[/]^}. 



Here, [/] denote the infinite array {f{i,j) ■ i,j£ N} and [f]^ stands for the fourth power 
of [/] (regarded as the kernel of a Hilbert-Schmidt operator). 

Lemma 2.6 Fix n,m ^ 1, and let fk G £o(N)°" and gu G ^l{K)°"' , k ^ 1, be such that 
fk — > / and gk — > g, respectively, in ^q(N)°" and ^g(N)°'". Then, for every r = 0, ...,n A m, 

fkK9k fK9 m£2(N)®-+"-2'-. 
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2.3 Multiple integrals, chaos and product formulae 

Following e.g. [32], for every q ^ 1 and every / G l^)"*^ denote by Jg{f) the multiple 
integral (of order q) of / with respect to X. Explicitly, 

Mf) = E f{ii,-,h)Xi,---Xi^= Yl f{k,...,iq)Xi,---X,^ (2.6) 

(ii,...,iq)eNi (n,...,ig)eA9 

= 9' X] fih,-,iq)Xi-, - ■ ■ Xi^, 

ii<...<iq 

where the possibly infinite sum converges in L^(r2). Note that, if / has support contained in some 
finite set {1, A^}'', then Jq{f) is a true integral with respect to the product signed measure 
l^^x,N)^ that is. 



J<iif)= [ fdl^'l'lj,)= [ 

Jn,-,N}i ^ ' ' Jan 



.,N}i " ' JA^ ' 

One customarily sets {Kf^ = M, and Jo(c) = c, Vc G M. It is known (see again [32]) that, if 
/ S ^0 (N)°'' and 5 S (^)°^) then one has the isometric relation 

E[Jq{f)Jp{g)] = l|g=p|g!(/,5r)^2(N)»9. (2.7) 

In particular, 

E[Mff]=ql\\ftH^)^q. (2.8) 

The collection of all random variables of the type Jn{f), where / S is called the gth 

chaos associated with X. 

Remark 2.7 Popular names for random variables of the type Jq{f) are Walsh chaos (see e.g. 
[ini Ch. IV]) and Rademacher chaos (see e.g. [U [TH IE])- In Meyer's monograph [20] the 
collection {Jg{f) : / S (N)"" , g ^ 0} is called the toy Fock space associated with X. 

Recall (see [3^) that one has the following chaotic decomposition: for every F G L^{a{X}) 
(that is, for every square integrable functional of the sequence X) there exists a unique sequence 
of kernels /„ € (N)°", n ^ 1, such that 

F = E{F)+Y,'Jn{fn)=E{F) + Y,n\ Uk, .■.,in)Xi, ■ ■ ■ X,^, (2.9) 

n>l n^l ii<i2<...<in 

where the second equality follows from the definition of Jn{fn), and the series converge in L^. 
Remark 2.8 Relation p. 91) is equivalent to the statement that the set 

{l}u(j{X,,...Xi„: li^k< ■■■ < in} (2.10) 

is an orthonormal basis of L'^{a{X}). An immediate proof of this fact can be deduced from basic 
harmonic analysis. Indeed, it is well-known that the set = {— 1, +1}^, when endowed with the 
product structure of coordinate multiplication, is a compact Abelian group (known as the Cantor 
group), whose unique normalized Haar measure is the law of X. Relation (12. 9p then follows from 
the fact that the dual of O consists exactly in the mappings uj 1 and lo i— > Xi-^{uj) ■ • • Xj^(a;), 
in > ... > ii ^ 1, n ^ 1. See e.g. [SI Section VII. 2]. 
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Given a kernel / on N", we denote by / its canonical symmetrization, that is: 



f {il,...,in) = ^y^/(V(l),-.-,V(n)j. 



where a runs over the n\ permutations of {l,...,n}. The following result is a standard multi- 
plication formula between multiple integrals of possibly different orders. Since we did not find 
this result in the literature (see, however, [32l formula (15)]), we provide a simple combinatorial 
proof in Section | 



Proposition 2.9 For every n,m^ 1, every f € (N)°" and g e Iq (N)"™", one has that 

nAm / \ / \ 

Jn if) Jm {g) = J2rl l^^j l^j Jn+m-2r [(Z^) 1a„+„_2.] • (2.11) 

Remark 2.10 Proposition 12.91 yields that the product of two multiple integrals is a linear com- 
bination of square-integrable random variables. By induction, this implies in particular that, for 
every / € (N)°" and every ^ 1, E\Jn{f)\'' < oo, that is, the elements belonging to a given 
chaos have finite moments of all orders. This fact can be alternatively deduced from standard 
hypercontractive inequalities, see e.g. Theorem 3.2.1 in [H] or [HI Ch. 6]. Note that similar 
results hold for random variables belonging to a fixed Wiener chaos of a Gaussian field (see e.g. 
PI Ch. V]). 



2.4 Finding a chaotic decomposition 

The second equality in (12. 9p clearly implies that, for every ii < ... < 

n\fnik,--,in) = E{F x Xi^ ■ ■ ■ XiJ. 

In what follows, we will point out two alternative procedures yielding an explicit expression for 
the kernels /„. 

(i) Mobius inversion (Hoeffding decompositions). We start with the following observation: if 
F = F[Xi., ...,Xrf) is a random variable that depends uniquely on the first d coordinates of the 
Rademacher sequence X, then necessarily F admits a finite chaotic decomposition of the form 

d 

F = E{F)+Y, nlUii,...,in)Xi,---X,^. (2.12) 

n=l l^ii<...<in^d 

Note that the sum in the chaotic decomposition (|2.12p must stop at d: indeed, the terms of 
order greater than d are equal to zero since, if not, they would involve e.g. products of the 
type Xj^ ■ ■ ■ Xj^^-^ with all the jaS different, which would not be consistent with the fact 
that F is a{Xi,... , Xf^j-measurable. Now note that one can rewrite p.l2p as F = E{F) + 
E/c{i,...,4 where, for I := {n, . . 

G{I)=a\fa{il,...,ia)Xi,...Xi^. 
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By exploiting the independence, one obtains also that, for every J := {ji, jn} C {1, . . . , d}, 

H{J) := E[F - E{F)\X,, , . . . , = J] 

icJ 

thus yielding that, by inversion (see e.g. (371 P- US]), for every n = 1, 

n\Uh,-:jn)X,,---Xj^= Yl {-IT~''E[F-E{F)\X,,,...,X,^]. (2.13) 

{il,..;ia}C{jl,...,jn} 

Formula (|2.13|1 simply means that, for a fixed n, the sum of all random variables of the type 
nlfn{ji, ...,jn)Xj^ ■ ■ ■ Xj^ coincides with the n}^ term in the Hoeifding-ANOVA decomposition 
of F (see e.g. [E]). By a density argument (which is left to the reader), representation (|2.13ll 
extends indeed to random variables depending on the whole sequence X. 

(ii) Indicators expansions. Assume that we can write F = F{Xi, . . . , Xd) with F : {—1, +1}'^ —>■ 
R. Taking into account all the possibilities, we have 

d 

F= E F(ei,...,ed)ni{x,=.J- 

{ei,...,ea)&{-l,+lY ^=1 

Now, the crucial point is the identity l{Xi=e,} = |(1 + £iXi). Hence 

d 

F = 2-^ F{ei,...,ed)J{{l+eiXi). (2.14) 

But 

d 

\{{l+eiXi) = 1+ Y 

1=1 l^ii^cZ li^i\<i2i^d 

-\- ^ ^ £«! £^12 S^is^ii ^42^*3 ~l~ • • • ~l~ • • • ^i^-^ii ■ ■ ■ -^idi 

l^n<«2<«3^d 

inserting this in (|2.14j) one can deduce the chaotic expansion of F. 



2.5 Discrete Malliavin calculus and a new chain rule 

We will now define a set of discrete operators which are the analogues of the classic Gaussian- 
based Malliavin operators (see e.g. [HI [26]). The reader is referred to [3l] and [32] for any 
unexplained notion and/or result. 

The operator -D, called the gradient operator, transforms random variables into random 
sequences. Its domain, noted domD, is given by the class of random variables F S L?'{a{X}) 
such that the kernels /„ G ^g(N)°" in the chaotic expansion F = E{F) + Xl^^i Jn{fn) (see 
verify the relation 

nn!||/„,||^2(p^-)®n < oo. 
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In particular, if F = depends uniquely on the first d coordinates of X, then 

F S domD. More precisely, D is an operator with values in x N, P k), such that, for 

every F = E{F) + ^(/n) £ domZ), 

AF = ^nJ„_i(/„(-,A:)), A: ^ 1, (2.15) 

where the symbol fni-,k) indicates that the integration is performed with respect to n — 1 
variables. According e.g. to [3ll[32], the gradient operator admits the following representation. 
Let uj = {uji,uj2, • • •) G il, and set 

and 

UJ- = {uJl,UJ2, ■ ■ ■ , -1, Wfc+l, . . .) 

to be the sequences obtained by replacing the k^^ coordinate of w, respectively, with +1 and — 1. 
Write F^ instead of -F(w^) for simplicity. Then, for every F S doniL', 

DfeFH = ^(F+-F,-), k^l. (2.16) 



Remark 2.11 It is easily seen that, if the random variable F £ L'^{a{X}) is such that the 
mapping {to, k) i— > ^(F^ — F^)(ti;) is an element of L^(r2 x N, P(>^k), then necessarily F £ domD. 

We write 6 for the adjoint of D, also called the divergence operator. The domain of 6 is 
denoted by domJ, and is such that domJ C L^(0 x N, P k). Recall that 6 is defined via the 
following integration by parts formula: for every F £ domD and every u G domJ 

E[F6{u)] = E[{DF, = {DF, n)i2(f,^N,p^,). (2.17) 

Now set -Lq((7{X}) to be the subspace of L^{a{X}) composed of centered random variables. 
We write L : L'^{a{X}) — > Lq((7{X}) for the Ornstein-Uhlenbeck operator, which is defined as 
follows. The domain domL of L is composed of random variables F = E{F) + X]„>i Jnifn) G 
L'^{a{X}) such that 

^n^n!||/„||^2(p^)®n < oo, 

and, for F £ domL, 

LF = Jnifn). (2.18) 

With respect to [3T1[32], note that we choose to add a minus in the right-hand side of (|2.18|) . in 
order to facilitate the connection with the paper [22]. One crucial relation between the operators 
(5, D and L is that 

5D = -L. (2.19) 
The inverse of L, noted L~^, is defined on F G Lq(cj{X}), and is given by 

F"^F = - V-J„(/„). (2.20) 



n>l 



n 
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Lemma 2.12 Let F £ domD be centered, and / : M ^ M 6e such that f{F) £ domD. Then 
E[Ff{F)] =E[{Df{F),-DL-'F)e2(^)]. 

Proof. Using (|2.19p and ()2.17|) consecutively, we can write 

E[Ff{F)] = E[LL-'FfiF)] = -E[6DL-'Ff{F)] = E[{Df{F),-DL-'F),2^^)]. 

■ 

Finally, we define {Pt : t ^ 0} = {e*^ : i ^ 0} to be the the semi-group associated with L, 
that is, 

oo oo 

PtF = ^e-'^*J„(/„), t^O, for F = E{F) + ^Jn{fn)eL^{a{X}). (2.21) 

n=0 n=l 

The next result will be useful throughout the paper. 
Lemma 2.13 Let F £ domD and fix k £N. Then: 

1. The random variables DkF, D^L^^F, F^ and F^ are independent of X^. 

2. It holds that |F+ - F| s: 2\DkF\ and \F^ - F| ^ 2\DkF\, P-almost surely. 

3. If F has zero mean, then E\\DL~^F\\^p^j^^ ^ iiJ||DF||^2(i^) with equality if and only if F is 
an element of the first chaos. 

Proof. 1. One only needs to combine the definition of F^ with (|2.16p . 

2. Use F^-F = ±{F+ - F^)l{x,=^,y = ±2DkF^x,=^,y 

3. Let us consider the chaotic expansion of F: 



F = Y,Jnifn). 



Then -DkL ^F = X]n>i Jn~i{fn{-, k)) and DkF = J2n^i nJn^i{fn{-, k)). Therefore, using the 
isometric relation (12. 



E\\DL-'F\\%^^^ = E^i^Jn-i{fn{;k)) 

fcGN \7i^l 

= y^(n — l)!||/n||^2(f^-)®n 

^n'^in- l)l\\fn\\%^j^)^r. 



I I =E\\DF^^^ 



Moreover, the previous equality shows that we have equality if and only if = for all n ^ 2, 
that is, if and only if F is an element of the first chaos. ■ 
We conclude this section by proving a chain rule involving deterministic functions of random 
variables in the domain of -D. It should be compared with the classic chain rule of the Gaussian- 
based Malliavin calculus (see e.g. [2Sl Prop. 1.2.2]). 
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Proposition 2.14 (Chain Rule). Let F £ domD and / : R — > R 6e thrice differentiable with 
bounded third derivative. Assume moreover that f{F) £ domD. Then, for any integer k, P-a.s.: 

Duf{F) - f'{F)D,F + l{f"{F+) + f"{F-)){D,FfX, 

Proof. By a standard Taylor expansion, 

DkfiF) = \{f{F^)-f{F;)) = \{f{F^)-f{F))-\{f{F-)-f{F)) 
= ^f'iF){F+-F) + \f"{F){F+-Ff+R, 

= f'{F)DkF + 

lifiF^) + /"(F,:)) ((F+ - Ff - (F,- - Ff) +R,+R, + R, 

= f'{F)D,F - \{f"{F+) + nF-)){D,FfXk + Ri + R2 + R3, 
where, using Lemma r2. 131 

\Ri\ ^ ^\noo\F+-Ff^^\f"'UDkF\^ 

\R2\ ^ ^\f"\oo\Pk-Ff^l\noo\DkFf 

m ^ ^irioo - Ff + |F,7 - F|') ^ 2\f"'UDkFf. 

By putting these three inequalities together, the desired conclusion follows. ■ 

2.6 Stein's method for normal approximation 

Stein's method is a collection of probabilistic techniques, using differential operators in order to 
assess quantities of the type 

\E[h{F)] - E[h{Z)]\, 

where Z and F are generic random variables, and the function h is such that the two expectations 
are well defined. In the specific case where Z ~ ^{0, 1), with =/K(0, 1) a standard Gaussian law, 
one is led to consider the so-called Stein equation associated with h, which is classically given by 

hix)-E[h{Z)] = f'{x)-xf{x), xGR. (2.22) 

A solution to (|2.22l) is a function /, depending on h, which is Lebesgue a.e. -differentiable, and 
such that there exists a version of /' verifying (|2.22|) for every a; € R. The following result collects 
findings by Stein [38l[39], Barbour [2], Daly [8] and Gotze [13]. Precisely, the proof of Point (i) 
(which is known as Stein's lemma) involves a standard use of the Fubini theorem (see e.g. [3, 
Lemma 2.1]). Point (ii) is proved e.g. in [MJ Lemma IL3] (for the estimates on the first and 
second derivative), in [H Theorem 1.1] (for the estimate on the third derivative) and in [2] and 
[T3] (for the alternative estimate on the first derivative). From here onwards, denote by the 
set of all real-valued bounded functions with bounded derivatives up to k^^ order. 



5^ —\f'"\^\DkF 
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Lemma 2.15 (i) Let W be a random variable. Then, W = Z ^ ^(0, 1) «/ and only if for 
every continuous, piecewise continuously differentiable function f such that'E\f'{Z)\ < oo, 

Wi.W) - Wf{W)] = 0. (2.23) 

(ii) Ifh G C^, then 112. 2S]) has a solution fh which is thrice differentiable and such that ||/^||oo ^ 
4||/i||oo, ||/;,'||oo ^ 2||/i'||oo and ||/;,"||oo ^ 2\\h"\\oo. We also have \\f'J ^ \\h"\\oc. 

Now fix a function h £ C^, consider a Gaussian random variable Z ~ ^(0, 1), and let F be 
a generic square integrable random variable. Integrating both sides of (|2.22|) with respect to the 
law of F gives 

\E[h{F)] - E[h{Z)]\ = \E[f'^XF)-FfH{F)]\ (2.24) 

with fh the function given in Lemma [2.151 In the following sections we will show that, if F is 
a functional of the infinite Rademacher sequence X, then the quantity in the RHS of (|2.24p can 
be successfully assessed by means of discrete Malliavin operators. 

Remark 2.16 Plainly, if the sequence F„, n ^ 1, is such that |ii^[/i(F„)] — £'[/i(Z)]| — > for 
every /i G C^, then F„ Z . 



3 General Bounds for Rademacher functionals 
3.1 Main bound 

The following result combines Proposition 12.141 with Lemma r2.151 (ii) and Relation (|2.24|) in order 
to estimate expressions of the type |ii^[/i(F)] — £'[/i(Z)]|, where F is a square integrable functional 
of the infinite Rademacher sequence X, and Z ~ ^(0, 1). 

Theorem 3.1 Let F G domD be centered and such that E\DkF\^ < oo. Consider a function 
h G and let Z ~ ^(0, 1). Then, 

\E[h{F)] - E[hiZ)]\ ^ min(4||/i|U, \\h"\\oo)Bi + \\h"\\^B2, (3.25) 

where 



Bi = E\l- {DF,-DL-'F)e2^^)\ ^ J E[{1 - {DF, -DL~^F),2^n))^] 



Bo — — E 
3 



\DL~'F\,\DF\')^,^^^ 



(3.26) 



Proof. Since h £ C^, Equality (|2.24|) holds. Observe that, since the first two derivatives of fh 
are bounded, one has that fh{F) G domD (this fact can be proved by using a Taylor expansion, 
as well as the assumptions on DF and the content of Remark l2.1ip . Using Lemma [2.121 one 
deduces that 

E[f',{F) - Ffh{F)] = E[f'^{F) - {Dfh{F),-DL-^F)p^n)\. 

We now use again the fact that is bounded: as an application of the first point of Lemma 
12.131 we deduce therefore that 

E [DkL-^F X {f'^iF+) + f';{F~)){DkFfXu\ =0, k^l; 
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in particular, the boundedness of /^', along with the fact that E\DkF\'^ < oo and Lemma [2.131 
(3), ensure that the previous expectation is well-defined. Finally, the desired conclusion follows 
from the chain rule proved in Proposition 12.141 as well as the bounds on ||/^||oo and ||//^"||oo 
stated in Lemma r2.151 (ii). ■ 

Remark 3.2 1. The requirement that < oo is automatically fulfilled whenever 

F belongs to a finite sum of chaoses. This can be deduced from the hypercontractive 
inequalities stated e.g. in [181 Ch. 6]. 

2. Since we are considering random variables possibly depending on the whole sequence X 
and having an infinite chaotic expansion, the expectation in (|3.26p may actually be infinite. 

3. Fix q ^ 1 and let F have the form of a multiple integral of the type F = Jq{f), where 
/ G £l{N)°i. Then, one has that 

-Z)L-iF),2(N) = ^\\DF\\l^^^, (3.27) 

{\DL-^F\, |Z)F|3),2(N) = ^\\DF\\j,^^y (3.28) 

4. Let G be an isonormal Gaussian process (see [16] or [26]) over some separable Hilbert space 
Sj, and assume that F G L'^{a{G}) is centered and differentiable in the sense of Malliavin. 
Then the Malliavin derivative of F, noted DF, is a ^-valued random element, and in [22] 
the following upper bound is established (via Stein's method) between the laws of F and 
Z~^(0,1): 

dTv{F, Z) i^2E\l- {DF, -DL~^F)^l 

where is here the inverse of the Ornstein-Uhlenbeck generator associated with G and 
dTV is the total variation distance between the laws of F and Z . 

5. Let be a compensated Poisson measure over some measurable space {A, A), with cj-finite 
control measure given by /i. Assume that F G L?'{a{N}) is centered and differentiable in 
the sense of Malliavin. Then the derivative of F is a random processes which a.s. belongs 
to i^(/i). In [2H Section 3] the following upper bound for the Wasserstein distance (see 
Section [3^ ) between the laws of F and Z ~ -yViO, 1) is proved (again by means of Stein's 
method): 

dw{F,Z) ^ E\l - {DF,-DL-^F)l2^,,)\+E I \DaF\^ x \DaL-^ F\f,{da). 

J A 



3.2 First examples: Rademacher averages 

A (possibly infinite) Rademacher average is just an element of the first chaos associated with X, 
that is, a random variable of the type 

oo 

F = Y^ aiXi, with a € (3.29) 

i=l 

See e.g. [HI Ch. IV] for general facts concerning random variables of the type (|3.29|) . The next 
result is a consequence of Theorem 13.11 and of the relations (|3.27l) - (|3.28l) . It yields a simple and 
explicit upper bound for the Gaussian approximation of Rademacher averages. 
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Corollary 3.3 Let F have the form 113. 29]) . let Z ~ ^{0, 1) and consider h£C'^. Then 



\E[h{F)] - E[h{Z)]\ ^ min(4||/i|U, \\h"\ 



1-E 



i=l 



+ ^11^ lloo l^Oi 
i=l 



(3.30) 



The proof of Corollary 13.31 (whose details are left to the reader) is easily deduced from the 
fact that, for F as in ([a^QD . one has that (l3^ - ([3?28D with q = 1 hold, and DiF = Ui for all 
i ^ 1. We now describe two applications of Corollary 13.31 The first one provides a faster than 
Berry-Esseen rate for partial sums of Rademacher random variables, using smooth test functions. 

Example 3.4 Let F„, n ^ 1, be given by 

1 " 

Fn = — / ^Xj. 



Then, Relation (|3.3QI) in the special case = 1 

\E\h{F)\-E\h{Z)\\ ^ 



-1/2 



{j^n} ^ yields the bound 
20 



where Z ~ ^(0,1), implying a faster rate than in the classical Berry-Esseen estimates. This 
faster rate arises from our use of smooth test functions; a related result is obtained in [15] using 
a coupling approach. 

The next example demonstrates that our techniques allow to easily tackle some problems 
which, due to their non-finite description, have previously escaped the Stein's method treatment. 



Example 3.5 For every r ^ 2, we set 



Xi 



The random variable F^ has the form (|3.29p . with Ui = Ijj^rjV^/*- We have that 



a. 



and also 



E 



i i + 



' ^--^ =ry ' 

i + l i"^ ^ i^ii + 1) ^ i(i + 1 



iSN i^r i^r ^ ' 

It follows from Relation (lOl that, for Z ~ ^(0, 1) 



\E{h{Fr)\-E\h{Z)\\ < 



mm 



\\h"\\ \ 

oo 1 II"' lloo ) 



+ 



20||/i^^||oo 
3(r - 1) 



Law 



In particular, F^ — > Z as r — > cxo 
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Remark 3.6 (A Mehler-type representation) Take an independent copy of X, noted X* = 
{XJ^, ^2 , ...}, fix t > 0, and consider the sequence X^ = {X\,X\^ defined as follows: X* is 
a sequence of i.i.d. random variables such that, for every A; ^ 1, = X^ with probability e~* 
and X\ = with probability 1 — e~* (we stress that the choice between X and X* is made 
separately for every fc, so that one can have for instance X\ = Xi, X\ = Xg, ... and so on). Then 
X* is a Rademacher sequence and one has the following representation: for every F G L?'{a{X)) 
and every lo = (wi, u;2, .-.) G = { — 1, +1}^ 

PtF{io)=E[F{X')\X = Lol (3.31) 

where Pt is the Ornstein-Uhlenbeck semigroup given in (I2.2ip . To prove such a representation of 
Pt, just consider a random variable of the type Xj-^ x • • • x Xj^, and then use a density argument. 
Now observe that, for a centered F = Yln>i ■^nifn) G ^oi^^i^})^ Equation ()2.20|) holds, and 
consequently 

-DkL-^F = V Jn-i{fnik, •)) = / e-'PtDkFdt = / E[DkF{X')\X]dt, 
SO that 



{DF, -DL~^F)f2^^^ = e"* V DkFE[DkF{X')\X] dt = / e-\DF, E[DF{X')\X]) 

^0 k>i -^0 



^2(N) 



dt. 



Note that the representation (jS.Sip of the Ornstein-Uhlebeck semigroup is not specific of Rade- 
macher sequences, and could be e.g. extended to the case where X is i.i.d. standard Gaussian 
(see for instance [211 Section 2.2]). This construction would be different from the one leading 
to the usual Mehler formula (see e.g. [26l Formula (1.67)]). See Chatterjee [6l Lemma 1.1] 
and Nourdin and Peccati [221 Remark 3.6] for further connections between Mehler formulae and 
Stein's method in a Gaussian setting. 



3.3 A general exchangeable pair construction 

Remark 13.61 uses a particular example of exchangeable pair of infinite Rademacher sequences. 
We shall now show that the chaos decomposition approach links in very well with the method 
of exchangeable pairs. Let us assume that F = F(Xi, ...,Xrf) is a random variable that depends 
uniquely on the first d coordinates = (Xi, . . . , X^) of the Rademacher sequence X, with finite 
chaotic decomposition of the form (I2.12p . Now assume that E{F) = and EiF"^) = 1 so that 

d d 
^ = J2 J2 n\Uii,...,in)X,,---Xi„ = Y,Jn{fn)- (3.32) 

n=l l^ii<...<in^d n=l 

A natural exchangeable pair construction is as follows. Pick an index / at random, so that 
P{I = i) = ^ for z = 1, . . . , d, independently of Xi,...,Xrf, and if / = i replace Xj by an 
independent copy X* in all sums in the decomposition (|3.32l) which involve Xj. Call the resulting 
expression F' . Also denote the vector of Rademacher variables with the exchanged component 
by X'^. Then {F, F') forms an exchangeable pair. 

In [31] an embedding approach is introduced, which suggests enhancing the pair (F, F') to a 
pair of vectors (W, W) which then ideally satisfy the linearity condition 

E{W' - W| W) = -AW (3.33) 
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with A being a deterministic matrix. In our example, choosing as embedding vector W 
(Ji(/i), . . . , Jdifd)), we check that 

EMfn)-Jnifnm 

1 



d 

i=l l^ii<...<in^d 

Thus, with W = {J[{fi), . . . , J'^^fd)), the condition (|3.33l) is satisfied, with A = {Xij)i^ij^d 
being zero off the diagonal and An,n = ^ for n = 1, . . . , d. Our random variable of interest F is 
a linear combination of the elements in W, and we obtain the simple expression 



E(F' - F\W) = -LF 
d 



1 



5DF. 



(3.34) 



This coupling helps to assess the distance to the normal distribution, as follows. 



Theorem 3.7 Let F G domD be centered and such that E{F'^) = 1. Let h G C^, let Z ~ 
c/K(0, 1), and let {F,F') form the exchangeable pair constructed as above, which satisfies (I3.34p . 
Denote by L' the Ornstein-Uhlenbeck operator for the exchanged Rademacher sequence X'^, and 
denote by {L')~^ its inverse. Then, 



\E[h{F)] - E[h{z)]\ ^ mu 



d 



'Var 



d 



E {{F' -F)x {{L')~^F' - L~^F) | W) 



+-\\h'\\^E [{F' - Ff X \{L'r'F' - L-^F\\ . 



Proof. Let h £ C^, and let g denote the solution of the Stein equation (|2.22p for h. Using 
antisymmetry, we have that for all smooth g, 

= E[{g{F)+giF'))x{iLr'F'-L~^F)] 

= 2E[g{F){{L'r'F' -L~'F)]+E[{g{F')-g{F)) x {{L')-' F' - L-^ F)] 

= lE[Fg{F)] + E [{g{F') - g{F)) x {{L'y'F' - L~'F)] ; (3.35) 
the last equality coming from (|3.34ll as well as the definitions of {L')~^ and L^^. Hence 

E[9'{F)-Fg{F)\ = E g\F) (^1 + ^[iF' - F) x {{LT'F' - L-'F)\W]^ + R, 
where by Taylor expansion 

\R\ ^ ^\\g"\\^E[{F'-Ff X \iLr'F'-L~'F\]. 
From ()3.35p with g{x) = x we obtain 



E[E{{F' - F) X {{L'y^F' - L~^F)\W)] = E[{F' - F) x {{L'Y^ F' - L~^ F)] 
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by our assumption that EF'^ = 1. We now only need to apply the Cauchy-Schwarz inequality 
and use the bounds from Lemma 12.151 to complete the proof. ■ 



Equation (I3.33P is key to answering the question on how to construct an exchangeable pair in 
the case of a random variable which admits a finite Rademacher chaos decomposition. Indeed, 
one can proceed as follows. Firstly, use W = (Ji(/i), . . . , Jdifd)), where the components come 
from the chaos decomposition. Then the results from [34] can be applied to assess the distance of 
W to a normal distribution with the same covariance matrix. Note that due to the orthogonality 
of the integrals, the covariance matrix will be zero off the diagonal. 

Finally, note that formally, using (I3.34p and that LF = lim^^o ^^^t~^ ' 
E{F{X',)-F{X,)\W) = 1 lim - ^^^'^^ = 1 ihn 1[£;(F(X* )|X,) - F(X,)], 

where the notation is the same as in Remark 13.61 In this sense, our exchangeable pair can be 
viewed as the limit as t — > of the construction in Remark 13.61 



3.4 From twice differentiable functions to the Wasserstein distance 

Although many of our results are phrased in terms of smooth test functions, we can also obtain 
results in Wasserstein distance by mimicking the smoothing in [35] used for Kolmogorov distance, 
but now apply it to Lipschitz functions. Recall that the Wasserstein distance between the laws 
of Y and Z is given by 

dw{Y,Z)= sup \E[h{Y)] - E[hiZ)]\ , 
/leLip(i) 

where Lip(l) is the class of all Lipschitz-continuous functions with Lipschitz constant less or 
equal to 1. Rademacher 's Theorem states that a function which is Lipschitz continuous on the 
real line is Lebesgue-almost everywhere differentiable. For any h £ Lip(l) we denote by h' its 
derivative, which exists almost everywhere. 



Corollary 3.8 Let Z ~ =yK(0, 1) and let F G domD be centered. Suppose that (|3.25l) holds for 

■-b 



every function h £ and that 4(Si + B2) ^ 5. Then 



dw{F,Z) ^ ,/2{Bi + B2){5 + E\F\). 
Proof. Let h £ Lip(l) and, for t > 0, define 

POO 

ht{x) = / h{Viy + Vl - tx)4){y)dy, 



where cf) denotes the standard normal density. Then we may differentiate and integrate by parts, 
using that (l)'{x) = —x4>{x), to get 

\ — t r°° 

h'lix) = yh'iViy + VT^x)(l){y)dy. 



Hence for < f < 1 we may bound 

llM'lloo ~7r"^'"°^y \y\Hy)dy ^ —q- (3.36) 
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Taylor expansion gives that for < t ^ i so that -v/t ^ \/l — t, 



\E[h{F)]-E[htiF)]\ ^ 



E J ^h{Viy + VT^tF)-h{VT^tF)}(Piy)dy 



+E\h{Vl^F) - h{F)\ 
^ Wh'WooVi j W{y)dy + \\h'\\^^^-^=E\F\ ^ ^/t |l + \e\F\^ 



Here we used that ||/i'||oo ^ 1 and that for < 6* < 1, we have (Vl - Ot)~^ < (0"^)"^. 
Similarly, \Eh{Z) — Eht{Z)\ ^ Using (|3.25p with (|3.36p together with the triangle inequality 
we have for all h G Lip(l) 

\E[h{F)] - E\h{Z)]\ ^ l.^[B,{l) + B2{l)) + \Vt{h + E\F\). 

Minimising in t gives that the optimal is achieved for t = 2(i?i(l) + i?2(l))/(5 + -E'|F|), and using 
this t yields the assertion. ■ 

To illustrate the result, for Example 13.41 we obtain dw{F,Z) ^ (for n ^ 6) which is of 
the expected Berry-Esseen order. 



4 Normal approximation on a fixed chaos 
4.1 Explicit upper bounds and CLTs 

We now focus on random variables of the type F = Jg{f), where Jg{f) is the multiple integral 
defined in Formula (|2.6p . q ^ 2 and / G ^q(N)°''. Due to the definition of the derivative operator 
D, as given in Section [2.51 we know that F S domD. Moreover, by combining Theorem 13.11 with 
formulae ([1071) - ([i05D one infers that, for every h e and for Z ~ ^{0, 1), 



\E[h{F)]-E[h{Z)]\ 
^ min(4||/i||oo, \\h"\\ 

c 



l--\\DF\\% 



20 

+ —\\h"\\^E\\DF\\%^^y (4.37) 



The following statement provides an explicit bound of the two expectations appearing in 

Km . 

Theorem 4.1 Fix q^2, let f e ^^(N)"*?, and set F = Jg{f). Then 



E< 1- -\\DF\ 



9-1 



+ (p-1)! 
p=l { 



^2(N) 

q-1 
p-l 



1-q\ 



2 



{2q-2p)\\\f4fxl^. 



2(q-p) \\e2rm«,2(g-p) 



1-fl! 



2 

£2(N)®'J 



9— 1 C 2 



(4.38) 

||2 

2{q-p) ll^2(f^)®2(g-p) 5 

(4.39) 
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and 



q ( 2 

E\\DF\\%^^^,^ ^ I (2<7-2p)!||/^-Vll,'2(N)«,2(,-,m). (4.40) 

Remark 4.2 According to Lemma [2^ one has that 

11/ A /ll^2(N)»(29-i) = 11/ f\\%{n) ^ \\f\\%{n)m x max ^ bi, bg-i), 

and also, by combining this relation with (|2.4p . for p = 2, n — 1, 

ll/*p /ll^2(p^)(8(2(q-p)+i) ^ ||/||£2(pj)®9 /max ^ p{i,hi, ...,bq^i). 

These estimates imply that, once ||/||£2(pj)®9 is given, the bound on the RHS of (I4.40p can be 
assessed by means of a uniform control on the "influence indices" /^(j, 6i, 

j ^ 1. As shown in [2T1 Theorem 2.1 and Section 3], when F = Jq{f) depends uniquely on a 
finite number of components of X, these influence indices can be used to directly measure the 
distance between the law of F and the law of the random variable, say Fq-, obtained by replacing 
the Rademacher sequence with a i.i.d. Gaussian one. This result roughly implies that the two 
components of the bound (|4.37|) have a different nature, namely: (I4.40p controls the distance 
between F and its Gaussian-based counterpart Fq, whereas (|4.38p - (l4.39p assess the distance 
between Fq and a standard Gaussian random variable. 

Proof of Theorem \4-l\ Observe that, by a classical approximation argument and by virtue of 
Lemma [2?6l it is sufficient to consider kernels / with support in a set of the type {1, A^}'', 
where N < oo (this is for convenience only, since it allows to freely apply some Fubini arguments). 
Since DjF = qJq~i{f{j, •)), one has that (due to the multiplication formula (12. lip ) 

(DjFf = g^|;r!(^;^)'j2(,_i_,)(/(jVK7(j,-)xlA,(,_,_,,) 

r=0 ^ ^ 

= q't.iP-MlZ^Sj^i'^^p) (/(J»^/(J» X lA,(,_,,) . (4.41) 
p=i \^ / \ / 

It follows that, by a Fubini argument, 

= 9!||/II?2(N)«, +'7J](P-1)!(^~ ) ^2(,-p)(/*p/x1a2(,_,)), 
p=i ^P ^ 

and the equality (I4.38P is obtained by means of orthogonality and isometric properties of multiple 
integrals. Inequality (|4.39|) is a consequence of the fact that, for any kernel h, \\h\\ ^ \\h\\. To 
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prove (|4.40p . use again (I4.4ip in order to write 

q 



E[{DjF)^] - "4 



/(i,-)^^:;/(i,-)xiA,(,_, 



H ( 2 ^ 



The conclusion is obtained by using the identity 

2 



^2(N)®2(9-p) 



^2(N)8'2(q-p) 



I f f 11^ 

\J > ^ l|^2(m®(2(g-p) + l) • 



52(g-p) 



We deduce the following result, yielding sufficient conditions for CLTs on a fixed chaos. 

Proposition 4.3 Fix q ^ 2, and let F^ = Jq{fk), k ^ 1, be a sequence of multiple integrals such 
that fk G eliny'i and E{F^) = g!||/fc||£2(N)®, ^ 1. Then, a sufficient condition in order to have 
that 



Fk^-^ ^(0, 1) 



(4.42) 



is that 

||/fc*r /fc||^2(N)®2(g-r) ^ 0, Vr = 1 , . . . , ^ - 1 . (4.43) 
Proof. One only needs to combine Theorem 14.11 and Relation (|4.37p with Point 3 of Lemma I2.4[ 



Remark 4.4 The content of Proposition 14.31 echoes the results proved in [27], where it is shown 
that, on a fixed Gaussian chaos of order q, the convergence to zero of the contractions fk\\, 
r = l,...,q — 1, is necessary and sufficient in order to have the CLT (I4.42p . See also [22] for 
bounds, involving norms of contractions, on the normal approximation of the elements of a 
Gaussian chaos. See [28] and [29] for analogous results concerning the normal approximation of 
regular functional of a Poisson measure. Finally, observe that we do not know at the present 
time wether the condition (|4.43p is necessary in order to have the CLT (|4.42l) . 

4.2 More on finite quadratic forms 

When specialized to the case of normalized double integrals, Theorem 13.11 Proposition 14.31 and 
Remark 12.51 yield the following result. 



Corollary 4.5 Let F^ = J2{fk), k ^ 1, he a sequence of double integrals such that E{F^) 

(4.44) 



2||/fc||^2(isj)®2 = 1 (for simplicity) . Let Z ~ o/K(0, 1). Then, for every k ^ I and every h ^ it 



holds that 

\E[h{Fk)] - E{hiZ)]\ 

^ 4\/2min(4||/i||oo, ||/i"||oo) x || A /fc x ||^2(n)®2 + 160||/i" ||oo x +2 /fcll^2(M) 

^ 4\/2min(4||/i||oo, ||/i"||oo) x \\fk*\fk x 1a2||^2{n)»2 



+ 160||/i"||oo X \\fk-k\ fk X 1a||||2(n)®2, 



and a sufficient condition in order to have the CLT (231^ ^^"^ ll/fc *i fi 



/"fc|k2(N)®2 



0. 
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Now consider kernels fk G £o(^)° : k ^ 1, such that for every k the following holds: (i) 
2||/fc||£2(r^)®2 = 1, and (ii) the support of fk is contained in the set {1, Then, the random 

variables 

Fk = J2{fk)= Yl fk(.i,j)X^Xj, k^l, (4.45) 

are quadratic functional (with no diagonal terms) of the vectors {Xi, Xk). Limit theorems 
involving sequences such as (I4.45p . for general vectors of i.i.d. random variables, have been 
intensively studied in the probabilistic literature - see e.g. [9], [H] and the references therein. 
The following result, providing a complete characterization of CLTs for sequences such as (I4.45p . 
can be deduced from the main findings contained in [9] (but see also [lO] for extensions to general 
multilinear functionals) . 

Proposition 4.6 Let X be the Rademacher sequence considered in this paper, and let Fk he 
given by ^J^JB). For every k ^ 1, write [fk] for the k x k square symmetric matrix {fkihj) '■ 1 ^ 
hj ^ k}. Then, the following three conditions are equivalent as k ^ co: 

(a) F,,^A"Z~^(0,1); 

(b) Trace{[/dn = fk\\%^n)^2 - 0; 

(c) E{Fi) ^ EiZ^) = 3. 

Remark 4.7 Condition (b) in the previous statement is often replaced by the following: 

k 

(b') Trace{[/fe]4} ^ and max V/fc(i,jf ^ 0. 

However, Relation (|2.5p shows that the second requirement in (b') is indeed redundant. 
In [5], Chatterjee proves that, in Wasserstein distance. 



k 



dw{Fk,Z) ^ JlTrace{[A]4} + [^/fc(i,j)]^. (4.46) 

i=i i=i 

Note that 

ma^[5^/,(z,j)2]t ^ J][J]A(z,j)2]i ^ max[5]A(z,jf]^ (4.47) 

j^K — ^ — ' ^ — ' Z J^k ^ — ' 

i=l j=l i=l 1=1 

and due e.g. to (|2.5p . Relation (|4.46p gives an alternate proof of the implication (b) — > (a) in 
Proposition 14. 6[ Another proof of the same implication can be deduced from the following result, 
which is a direct consequence of Corollary 14.51 and Lemma [231 (Point 3). 



Corollary 4.8 Let the notation and assumptions of this section prevail. Then, for every k and 
for every h ^ C^, one has that 

\E[h{Fk)]-E[h{Z)]\ 

^""^ ^ ' ■ "* i2 ll^2(p^)®2 + iDU||/^ llooll/fc *i /A: X -'-A=||^2(p^)®2 



^4y2min(4||/i||oo,||/i"||oo)||/fc*i/fc x IA2 ll^2(N)®2 + 160||/i"||oo||/fc A x Ias"^ 



^ 4^min(4||/i|U, \\h"\\^)^Trs.ce{[fkf} + 160||/i"||ooTrace{[/fc]^}. 
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5 Sums of single and double integrals and applications to weighted 
runs 



5.1 Motivation: infinite weighted runs 

Due to their simple dependence structure, runs lend themselves as good test examples for normal 
approximations; for applications see for example [T]. The first Berry-Esseen bound for overlap- 
ping success runs was derived in [12]. A multivariate normal approximation for the count runs 
of finite length in the context of Stein's method can be found for example in [34]. Weighted runs 
can be viewed as a special case of weighted U-statistics; a normal approximation using Stein's 
method is available for example in [36] • Typically these studies focus on counting runs in a finite 
window of a possibly infinite sequence, see for example [l] for details. In contrast, our method 
allows to consider infinite weighted sums of runs. 

Let ^ = : n G Z} be a standard Bernoulli sequence. This means that ^ is a collection 
of i.i.d. random variables such that P[^i = 1] = P[S,i = 0] = 1/2. In the sequel, we will 
suppose without loss of generality that ^ = ^(1 + X) where X = {X^ : n S Z} is a two-sided 
Rademacher sequence, obtained e.g. by juxtaposing two independent Rademacher sequences 
of the kind introduced in Section 12.11 Note that we chose Z as a parameter set, in order to 
simplify the forthcoming discussion. Also, in what follows we will implicitly use the fact that 
all the results and bounds established in the previous sections extend to this setting, by simply 
replacing sums over N with sums over Z (the proof of this elementary fact is left to the reader). 
Now fix an integer d ^ 1 as well as a sequence : n ^ 1} of elements of £^(Z). In this 

section, we study the normal approximation of the sequence {G„ : n ^ 1} defined by 

Gn = Y.a'['k^■■■^i+d. (5.48) 

We call Gn an InEnite weighted d-run. Actually, we will rather focus on the normalized version 
of Gn, namely 

Observe that, in order to apply Theorem 13.11 one needs to find the chaotic expansion of F„ or, 
equivalently, that of G„. Using the identity ^ = ^(1 -|- X), it is immediately deduced, see also 
Section [231-(ii), that 

Gn = 2-('^+^) J^aJ")(l + X)...(l + X,+,) 

= 2-('^+i)^a(") (with/={n,...,i|,|}) 

ieZ I(l{i,...,i+d} 
r=0 jGZ IC{i,...,i+d} 



d+1 

E{Gn) + Jr 
r=l 



\I\=r 

( \ 

1 ieZ I(Z{i,...,i+d} I 

\ \I\=r I 



(5.50) 
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where the tilde indicates a symmetrization. In particular, it is now immediate to compute DGn 
and DL~^Gn, by using the definitions given in Section [231 However, since the analysis of (|5.48|) 
is meant to be only an illustration, from now on we will focus on the case where d = 1 (2-runs). 
The general case could be handled in a similar way (at the cost of a quite cumbersome notation). 



5.2 Normal approximation of sums of single and double integrals 

We will deduce a bound for the quantity \E[h{Fn)] — E[h{Z)]\ (where Z ~ ^{0, 1) and F„ is 
given by (|5.49p ) from the following result, which can be seen as a particular case of Theorem 13. 11 



Proposition 5.1 (Sum of a single and a double integral) Let F = Ji{f) + ^2(5) with f S ^^(Z) 
and g G ^q(Z)°^. Assume that X^jg^ |5((i,/c)| < 00 for all A; G Z. Also, suppose (for simplicity) 
that VarF = 1, and let h ^ Cf. Then 

\E[h{F)] - E[h{Z)]\ ^ min(4||/i||oo, \\h" \\^){2^\\g ^\ gl^,\\,2^^)^2 + 3||/*1 ^ll^^cz)) 



^^60 II/," II 



f\k) + lQ[Y,\9{iM 



(5.51) 



k£2 

Remark 5.2 By applying successively Fubini and Cauchy-Schwarz theorems, one has that 

\\f*i9\\%(z) = -/"(^ aAaiiJ) ^ ll/ll^2(z)IIS'*i5ll£2(z)®2. (5.52) 

This inequality has an interesting consequence. Suppose indeed that the sequences Ji(/n) and 
J2{gn), n ^ I, are such that, as n ^ 00: (a) ||/n||£2(z) 1, (b) 2||5„||^2(2)®2 ^ 1, (c) 

Efcgz/n( fe) 0, (d) Efcez (Eiezl5'n(i,^)|)'' ^ 0, and (e) \\gn A 9n\\e^z)'»2 ^ 0. Then the 
estimate (|5.52|) and Proposition 15. II imply that, for every {a, (3) / (0,0), the sequence 

aJlifn) + PJ2(,gn) . 1 

, , n ^ 1, 

converges in law to Z ~ ^/K (0, 1), and therefore that the vectors {Ji{fn), J2{9n)), n ^ 1, jointly 
converge in law towards a two-dimensional centered i.i.d. Gaussian vector with unit variances. 
Note that each one of conditions (a)-(e) involves separately one of the two kernels /„ and gn- See 
[24] and [28j Section 6], respectively, for several related results in a Gaussian and in a Poisson 
framework. 

Proof of Proposition [5Jl Firstly, observe that D^F = 2Ji(^g{-, k)) + f{k) = 2Y^^^j^g{i,k)Xi + 
f{k) so that, using (a + hf ^ 8(a^ + h^), 

E\DkF\'^ s^l2SEi^g{i,k)xA + 8/^(A:) ^ 128 [ J] |5(i, A;)| j +Sf^{k)< 00. 

We are thus left to bound Bi and B2 in Theorem 13.11 taking into account the particular form 
of F. We have -L'^F = \ J2{g) + Ji{f) so that -DkL'^F = Ji{g{-, k)) + f{k). Consequently, 
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with the multiphcation formula (12. lip , we get 

= 2 ^ Ji [g{., k)f + ?.Y, fik)Ji {gi; k)) + \\f\\%^^^ 
= 2^J2 (<?(-, k) g{; fc)lA,) + 3 ^ f{k)Ji {g{; k)) + 



+ 2||5ll 



2J2(9 A glA,) + 3Ji{f4 g) + 



£2(Z) 



+ 2NI 



so that 



-VarFp = E \2J2{g A g^A,) + SJi{f 4 g)f 

= 8\\g4g^A2\\i2^z)m+9\\f-k\g\\%(^^y 



Hence, 



Bl ^ y8|b *l5lA2 11^2(2)82 +9||/*i 511^2(2) 

{2V2\\g 4 glA, ||^2(^)»2 + 3||/ 4 g\\,2^^^) . 
Now, let us consider B2. We have 

|A.FNl/W| + 2j]|<7(i,A.)|. 



Similarly, 



\DkL-'F\ = \f{k) + Ji{g{;k))\ 



fik)+^g{i,k)Xi 



\f{k)\+^\g{i,k)\^\f{k)\+2Y,\g{^,k)\. 



Still usine; (a + b)"^ ^ 8(a* + 6*), we deduce 



Y,\DkL-'F\x\D,Ff ^ ("1/(^)1+ 2^ 



fcez 



Hence 



and the desired conclusion follows by applying Theorem I3.ll 
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5.3 Bounds for infinite 2-runs 

When d = 1, Proposition 15.11 allows to deduce the following bound for the normal approximation 
of the random variable F„ defined in ()5.49p . 



Proposition 5.3 Let {F„ : ?i ^ 1} be the sequence defined by Fn = " - with 



Here, ^ = {£,n n £ stands for the standard Bernoulli sequence and {a(") : n ^ 1} is a given 

"2 

--b ■ 



sequence of elements of l'^{'L). Consider a function h £ C'^. Then, for Z ~ ^(0, 1) 



\EIHF)]-EIHZ)]\ < 7 ^ mm(4|M 



jG2 



16 VarG„ 

VarG. = A j:(arV + ^ E "^^^S" (5-54) 
follows that a sufficient condition to have Fn Z is that 



^(a5"V = o ((VarG„)2) asn^oo. 



Proof. Identity (|5.54p is easily verified. On the other hand, by (|5.50ll . we have 



Fn = %^^ = Uf) + J2{9\ 

VVarGn 



with 



^ = R ^"""^(^W ® Ha+l} + l{a+l} ® Ma})- 

Now, let us compute each quantity appearing in the RHS of (I5.5ip . Hi ^ j then 

{9A9){hj) = ^^T^ E «i"^«i"^(l{a}Wl{a+l}(A:) + l{a+l}(^)l{a}W 

" a,b,k&Z ^ 



64VarG, 



X lw(j)l{6+i}(^i + l{b+i}{j)l{b}{k) 



(n) (n) ^ . (n) (n) ^ 
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Hence 



64VarG„ 
64VarG„ 

We have 



lb*i s-Ia; 



32VarG„ 



a,b,k(^Z 



32VarG„ 

Hence, using (a + 6 + c + (i)^ ^ A{a^ + b"^ + + cP), 



32VarG„ 



16VarG„ 

V2 
16VarG„ ' 



('")\4 



We have, using (a + 6) ^ 8(a + 6 ), 



J]ai")(l|„|(A:) + l|,+i}(fe)) 



256(VarG: 



^ 16(VarG„)2 ^ ^"^^ ^ ' 
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Finally, still using (a + bp < 8(0"^ + 

4 



E 



4096(VarG„)2 



E 

fcez 



XI l«a"^l|lw(^)l{a+l}(^) + l{a+l}Wl{a}(fc)| 



4096(VarG„ 
1 

256(VarG„)- 



feez 



fcez 



{'«)n4 



Now, the desired conclusion follows by plugging all these estimates in (|5.51|) . after observing that 
Eiez(«l"'')^("i+i)^ ^ Eiez("i"^)'^' by the Cauchy-Schwarz inequality. ■ 



Example 5.4 1. Choose a^""^ = l{i Then X]iez("i"'')^ = ''^ ^iid 

so that (I5.53P gives a bound of order n~^/^ overall. This is the same order as obtained in 
|34| . also for smooth test functions, while [36] obtain a bound of this order in Kolmogorov 
distance under suitable conditions on the weights. 



(")\4 



2. Choose ) = i"! l^n,n+i,...}{i)- Then = 0{n-^) and 



(")\4 



SO that (|5.53|) also gives a bound of order n overall. 



6 Multiple integrals over sparse sets 
6.1 General results 

Fix d ^ 2. Let F/v, ^ 1, be a sequence of subsets of N'' such that the following three properties 
are satisfied for every N ^ 1: (i) F/v 7^ 0, (ii) F]\f C (as defined in (|2.2p ). that is, Fn is 
contained in {!,... ,N}'^ and has no diagonal components, and (iii) F/v is a symmetric set, in 
the sense that every G F/v is such that S F/v for every permutation a 

of the set {1, d}. Let X be the infinite Rademacher sequence considered in this paper. Given 
sets F]\f as at points (i)-(iii), we shall consider the sequence of multilinear forms 

Sn = [dl X \FM]-^ E • ■ • ^^rf = '^'i^M^ ^ ^ 1' (6-^^) 

(ii,...,id)eFjv 
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where |F/v| stands for the cardinahty of F^r, and 



f Nik,-, id) ■■= [dl X l-^A^I] ^ X lFiv(il,---,«d)- 

Note that E{Sn) = and E^Sj^) = 1 for every N. In the paper [3], Blei and Janson studied the 
problem of finding conditions on the set -F/v, in order to have that the CLT 

Sn^-^ Z J^{0,1), N^oo, (6.56) 

holds. 

Remark 6.1 Strictly speaking, Blei and Janson use the notation Fjy in order to indicate the 
restriction to the simplex {(ii,...,?^) : «i < «2 < ■•• < id} of a set verifying Properties (i)-(iii) 
above. 

In order to state Blei and Janson's result, we need to introduce some more notation. 

Remark on notation. In what follows, we will write to indicate vectors = (ai,...,afc) 
belonging to a set of the type {!,..., A^}'^ =: [N]^, for some k,N ^ 1. We will regard these 
objects both as vectors and sets, for instance: an expression of the type a^ n i/ = 0, means that 
the two sets {ai,...,Ofc} and have no elements in common; when writing j G a.}., we 

mean that j = ar for some r = 1, k; when writing a^ C i/ {k ^ I), we indicate that, for every 
r = 1, k, one has = is for some s = 1, I. When a vector a^ enters in a sum, we will avoid 
to specify a^ € [N]^, whenever the domain of summation [A^]*^ is clear from the context. 

Given N 1 and an index j £ [N] , we set 

F;fj = {id e Fjv : j G id}- 

For every A^, the set C F/v x F/v is defined as the collection of all pairs (i^, k^) G F/v x F/v 
such that: (a) n k^ = 0, and (b) there exists p = 1, d — 1, as well as C and k^ C k^ 
such that 

((kp, id \ ip), (ip, kd \ kp)) G F/v x F/v, 

where \ ip represents the element of [A^]"'"^ obtained by eliminating from i^ the coordinates 
belonging to ip, and (kp, id \ ip) is the element of [A^]*^ obtained by replacing ip with kp in i^ (an 
analogous description holds for (ip, k^; \ kp)). In other words, the 2d indices ii, . . . ,id,ji, ■ ■ ■ ,jd 
can be partitioned in at least two ways into elements of F/v. 

Theorem 6.2 ([H Th. 1.7]) Let the above notation and assumptions prevail, and suppose that 
I ^* ' I 

lim max , ^ = 0, and (6.57) 

N^oo j^N I F/v I 
\F*\ 

lim = 0. (6.58) 

JV^oo |F7v|2 ^ ^ 

Then Relation i6.56]} holds, with convergence of all moments. 

Remark 6.3 As pointed out in [4], Condition (|6.58l) can be described as a weak "sparseness 
condition" (see e.g. [3]). See also [H Th. 1.7] for a converse statement. 
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The principal achievement of this section is the following refinement of Theorem 16. 21 

Theorem 6.4 Under the above notation and assumptions, consider a function h ^ C^. Then 
there exist two universal constants Ci and C2, depending only on d, \\h\\oo and \\h"\\^, such that, 
for every N ^ \ and for Z ~ ^(0, 1), 



\E[h{Z)]-E[h{SN)]\ ^Ci 



\Fn\ 



+ C2 



N,j\ 
l^^N I Fat I 



max 



(6.59) 



Proof. By combining (I4.37P with Theorem 14. 11 we know that there exist universal combinatorial 
constants Up > 0, p = 1, d — 1 and Pf > 0, I = 1, d, such that 



d-i 



\E[hiSN)]-E[h{Z)]\ ^ ||/l||oo2]«^||/7V^/ivlA-,_^, 



|£2(p^)®2{d-p) 



p=l 



d-1 



+ \\h"\\ooY^Pf\\fN*\-' I 



N\\l2t^\^2{d-l) + l ■ 



(6.60) 



(6.61) 



1=1 



We now evaluate each norm appearing in (I6.60p - (|6.6ip . According to the second and the third 
points of Lemma [231 and using the fact that ||/Ar||£2(i^)®d = {d\)~^ , one has that 



1 



dx dl i^TV \Fn\ 



max 



(6.62) 



Also, by combining the two inequalities in the second point of Lemma [23 we deduce that, for 
every I = 2, d — 1, 

1 



\\fN*i /7v||£2™^®2{d-^)+ 



dV 



(6.63) 



{d\)y^Vd 



IF; 



N,j\ 



max ■ , 

JiiN I Fat I 



{d\)y^Vd 



max , ^ , 
JiiN \Fn\ 



Now fix p = 1, ...,d — 1. One has that 



Tn 



*?, fN 1a 



(6.64) 



(d!|F^|)^ 



:0} 



7=0 



p 



1) {d\\FN\Y' 



where 
Uo = 



lFjv(ad-p, hp)lF^{y^d-p, bp)lFjv(xd-p, yp)lFjv(ad-p, yp)l{ad_pnxd_p=0}l{bpnyp = 



:0} 



< I 



#1 



AT h 
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and 



^ \Fn\ X max|F;^ - I, 



and finally, for 7=1, — 1, 

= ^ ^ Ifjv (^oi—p) bp— 'y)l_FjY (X|^_p, u^, bp_^) X 

— p, U'y, bp — 

^d — p) yp— 7 

X lFiv(xd-p, U^, yp-^)lFjv (ad-p, U^, yp-7)l{ad-pnxd-p=0} l{bp_^nyp-^=0} 

< X lFiv(ad-p,u^,bp-^)li?j^(xd-p,u^,bp-^) X 

^d— p; yp — f 

xlFiv(xd-p,u^,yp-'y)lF^r(ad-p,u^,yp-7)l{bp-^nyp-^=0} 
The proof is concluded by using the estimates (16.621) (for p = 1) and (I6.63P (for p = 2, d — 1). 



We shall now state a generalization of Theorem 16.41 providing an explicit upper bound for 
the normal approximation of multiple integrals defined over infinite sets. This result shows 
once again that our approach allows to deal directly with the Gaussian approximation of random 
variables that are functions of the whole infinite sequence X. Consider a sequence of real numbers 
P = {Pi : i ^ 1} £ i'^iN), and define the finite measure on N 



(6.65) 



We denote by {d ^ 2) the canonical d-product measure associated with m^. For every d ^ 2 
and for every set F C N"', we define the sets F# C F x F and F* , j = 1, 2, ... as before. In 
particular, F* is the collection of all (ii,...,id) S F such that j = ik for some k. For f3 as 
before, and F S (possibly infinite) such that 'mp{F) > 0, we are interested in the normal 
approximation of the random variable 

J((^^P)= ...... Lii/. yZ Pn■■■(3^,X,,...Xi^. 



The following statement, whose proof (omitted) follows along the lines of that of Theorem 5.4, 
provides an upper bound for the normal approximation of J(/3, F). 

Proposition 6.5 Let Z ~ ^(0, 1). Under the above notation and assumptions, for every h £ 



one has that there exist positive constants Ki and K2, depending uniquely on d, 
such that 



and \\h"\ 



\E[h{Z)]-E[h{J{(3,F))]\ ^Ki- 



m 



(F#) 



mj{F) 



K2 



m${F*) 
sup — -T-, — — 
3^1 miiF) 



(6.66) 
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6.2 Fractional Cartesian products 

In this section, we describe an explicit application of Theorem 16.41 The framework and notation 
are basically the same as those of Example 1.2 in Fix integers d ^ 3 and 2 ^ ?n ^ d — 1, 
and consider a collection of distinct non-empty subsets of [d] = {l,...,d} such that: 

(i) Si / 0, (ii) [JiSi = [d], (iii) \Si\ = m for every i, (iv) each index j £ [d] appears in exactly 
m of the sets Si, and (v) the cover {S*!, 5;^} is connected (i.e., it cannot be partitioned into 
two disjoint partial covers). For every i = I, ...,d and every = {yi, ■■■^Ud) G N"', we use the 
notation vr^^y = {yj : j G Si). Note that the operator vrg. transforms vectors of N"^ into vectors of 
N™. For every N ^ d™, write n = [A^-^/^J , that is, n is the largest integer such that n ^ N^/^ . 
Now select a one-to-one map (p from [n]™' into [N], and define 

F*N = {(v^(7r5,kd),...,(^(7rs,k^)) : G [nY] C [N]\ 
(note that, in general, -F^ is not a symmetric set), F^* = H A^, and also 

Fn = symiP;;), (6.67) 
where sym(F^*) indicates the collections of all vectors = {yi, ■■■,yd) G N'^ such that 

(ya(i),--->y<7{d)) e ^a^* 

for some permutation a. 



ing to h6.61\) and h6. 55]) . Then, for every h £ C"^, there exists a constant K > independent of 



Proposition 6.6 Let Z ~ ^(0, 1), and let F/v and Sn, N ^ d^, be respectively defined accord 

--b 

N , such that 

\E[h{SN)]-E[h{Z)] \ ^ ^ 



iVl/(2m) ' 



Proof. The computations contained in [U p. 16] can be straightforwardly adapted to our setting, 
so that we deduce that the sequence {F/v : ^ d™} has combinatorial dimension a = d/m. 
Recall that this means that there exist finite constants < Q2 < Qi < oo (independent of 
N) such that: (a) |FAr| ^ Q2N°', and (b) for every Ai,...,Ad C [N], \Fn n (^1 x • • -Aa)] ^ 
Qi{maxi^j<^d\Aj\)°' . Thanks to Theorem 16.41 to conclude the proof it is therefore sufficient to 
check that, as — > 00, 

maxlF^^^jl =0(iV"-^) and |F*| = ©(iV^"-!/"^). (6.68) 

Start by observing that every element {zi, Zd) of Fjv has the form 

(zi, Zd) = ((/?(7rs^(j)krf), V9(7rs^(^,krf)), (6.69) 

where k^^ = {ki,...,kd) G [n]'^ and cr is a permutation of [d]. Since 99 is one-to-one, it follows 
that, for every j ^ A^, there are at most did x n'^~"^ elements of the set F^j. To see this, just 
observe that, every {zi, Zd) G Fj\[j is completely specified by the following three elements: (i) 
a permutation a of d, (ii) the index a = 1, ...,d such that Za = j, and (iii) the values of those 
coordinates ki, such that b ^ Sf^(^a)- Since n ^ A^^/™ by construction, one immediately obtains 
the first relation in (16.681). 
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To prove the second part of (|6.68j) . we shall first show that, if {{zi, z^), {z'l, z'^)) G F* are 
such that {zi,...,Zd) is as in (|6.69p, and {z[,...,z'j) = {ip{7rs^,^^^^k'd), .■.,(p{iTs^,^^yd)) i^'d G W*) 
then krf and k'^ must have m coordinates in common. The definition of implies indeed that, 
for such a vector ((zi, ■■■,Zd), {z'l, there exists 

such that, for some p = l,...,d — 1: (a) there exist indices ai,...,ap and bi,...,bp such that 
7rSp{,^)id = 7r5„(^)kd for every i = 1, ...,p, and (b) there exist indices vi, ...,Vd-p and wi, ...,Wd-p 
such that vr5pj^ jirf = Tr^^,^^ ^k'^ for every i = l,...,d — p. Note that then we necessarily have 
{ai, ap,vi, ...,Vd-p} = {bi, ...,bp,wi, ...,Wd-p} = [d]. By connectedness, there exists at least 
one q E [d] such that iq (i.e., the gth coordinate of i^) belongs both to one of the sets irSpf^^.-^'id and 
to one of the sets Tr^^^^^i^^. This last property implies that there exists a constant L, independent 
of N, such that < Ln^'^-^ ^ LiV2a-i/m_ rpj^-g concludes the proof. ■ 

Remark 6.7 Note that the combinatorial dimension a = d/m, as appearing in the proof of 
Proposition 16.61 is an index of interdependence between the coordinates of the sets Fjy. See Ch. 
XIII in [3] for more details on this point. 

6.3 Beyond the Rademacher case: a question by Blei and Janson 

Now we go back to the framework and notation of Section lOl so that, in particular, the sequence 
Sn, N ^ 1, is defined according to (|6.55ll . For every define to be the random variable 
obtained from (|6.55p by replacing the sequence X with a i.i.d. Gaussian sequence G = {Gi : i ^ 
1}, where each Gi has mean zero and unit variance. A natural question, which has been left open 
by Blei and Janson in [U Remark 4.6], is whether under the conditions (I6.57p - (l6.58p the sequence 
S^, ^ 1, converges in law towards a standard Gaussian distribution. Note that this problem 
could be tackled by a direct computation, based for instance on [22] or [27]. However, the results 
of this paper, combined with those of [21], allow to elegantly deduce a more general result, which 
we provide in the forthcoming statement. In what follows, we write V = {Vi : i ^ 1} to indicate 
a centered i.i.d. sequence, with unit variance and such that F|Vip = rj < oo (note that the 
results of [21] would allow to obtain similar results in even more general frameworks, but we do 
not look for generality here). We also denote by the random variable obtained from (|6.55ll 
by replacing X with V . 

Proposition 6.8 Under the above notation and assumptions, consider a function h G C^. Then, 
there exist two universal constants Bi and B2, depending uniquely on d, rj, \\h\\ao, ||^"||oo o,nd 
||^'"||oo, such that, for every N ^ 1 and for Z ~ ^(0, 1), 

(6.70) 

In particular, if Il6.57\) - f6. 58\) take place, then converges in law towards Z . 
Proof. One has that 

\E[h{Z)] - E[h{S]^)]\ ^ \E[h{Z)] - E[h{SN)]\ + \E[h{SN)] - E[h{S]^)]\, 



\E[h{Z)] 



E[h{S}^)]\ 



^ Si 



\Fn\ 



+ B2 



max , ^ , 
j^N \Fn\ 
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and the conclusion is obtained by combining Theorem 16.41 with the fact that, according to [2TJ 
Theorem 3.18, case H2], there exists a constant Q, depending only on ||/i.'"||oo and r/, such that 



\E[h{SN)] - E[h{S]^)]\ ^ Q 



j<7V |Fiv| 



max 
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Appendix: Some technical proofs 
Proof of Lemma 12.41 

In what follows, for /c ^ 1, we shall write to indicate the generic vector = (ai, ...,afc) € N'^. 
We start by proving the first point. To do this, just write, using the Cauchy-Schwarz inequality, 

11/ *r 5'll^2(j(^^®(n+m-r-i) = ^ ^ f *r 9 {^m+n—r-l) 

^m-{'n — r — l 



m—ri '^r—l i ^/ j ) 

i?! — r jm — r ^r — l ^ / 



in — r ^r — l jm — r ^r — l 

The first part of the second point is obtained by writing 

2 / \ 2 



max I X f{j,K^i) ^ E E /'(i'bn-i) 

= \\f<-'f\\' 



^2(N) 

^ max X /2(j,b„_i) X X X /2(j,b„_i) 
= max X /2(j,b„_i) X ||/||^2(N)®n. 

bn-l 

For the second part, we have, by applying (in order) the Cauchy-Schwarz inequality, as well 
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as the previous estimate, 

II/*/ 9ll^2(p^)®(n+m-2i + l) 
j k„_i / J 

) ji— 1 ) Im— i ) 

i k„_j l-m-i ji-l 



' ')\ \ £2 (M)'S{" + m-2l) 



^ max ^ /^(j,b„_i) X ^5f2(i^) = max ^ /^(j,b„_ 



X 




lf2(N)®" 



2 

II „I|2 _ II ^ , n-1 fii II „|| 



E I E /^(J>bn-l) I X lbll£2(N)»m = ||/*;^_i /||£2(M) X II "112 



Finally, for the third part, just observe that 



/||^^2(N)®(2n-i) = E_E _E /(J,in-l)/(j,jn-l) = E I E /(•?■' i'^-l) I = 11/ < ^ /ll^^2(N) 

j iji-ljn-l j \in-l 



and, for 2 ^ / ^ n: 

J-l -fll2 

2 



*l /ll^2(f^)®{2n-2! + l) 



= E E E I E-^'^'^'^'-i''^"-')-^^-^'^'"!'^"-') 

i=j k„_; 

=^ E E-^(^'-i'^"-'+i)-^^^'-i'^"-'+i) 
= ii/*i:}/xiA- 



^2„-2i+2ll£2(N)®(2"-2i+2)- 

Proof of Lemma 12.61 

We have, by bilinearity and using the first point of Lemma [2 



ll/fc K 9k- f *r 5||^2(p^)8(m + n-2r) = \\fk K {Ok - Q) + (fk " /) *r 5 1 1 £2 (m+n-2r) 

^ ll/fc K idk — 5)ll£2(N)®(7,i + n-2r) + ||(/fc " /) *r 1 1 £2 (N)® (m+n-2r) 

^ ll/fc ll£2(N)»"|bfc - g\\l2{N)<Sm + ll/fe - /||£2(N)®"||9||£2(p^)(g.m 0. 



37 



Proof of Proposition 12.91 

We assume, without loss of generality, that m ^ n. We first prove (|2.1ip for functions / and g 
with support, respectively, in and A^, for some finite N. One has that 

Jn if) Jm. (g) = [ {f*'o 9} dfiT^"' = [ {f0g} d^lT^^. 

For r = 0, m, we denote by 11^ (n, m) the set of all partitions vr of {1, n + m} composed of 

i) exactly r blocks of the type {11,12}, with 1 ^ ii ^ n and n + l^-i2^n + m, 

ii) exactly n + m — 2r singletons. 

For instance: an element of II2 (3, 2) is the partition 

^ = {{1,4}, {2, 5}, {3}}; (6.71) 

the only element of Uq (n, m) is the partition vr = {{1} , {2} , {n + m}} composed of all sin- 
gletons; an element of Us (4, 4) is the partition 

7T = {{1, 5} , {2, 6} , {3, 7} , {4} , {8}} . (6.72) 

It is easily seen that Ilr- (n,m) contains exactly (™) elements (to specify an element of 
rij. {n,m), first select r elements of {1, ...,n}, then select r elements in {n + 1, ...,re + m} , then 
build a bijection between the two selected r-sets). For every r = 0, m and every vr E 11^ (n, m), 
we write B^^^ (it) to denote the subset of {1, A^}""'"™' given by 

{(ii, ...,in,in+i, ■■■■lin+m) G {1, A^}""''"* : ij = ik iff j and k are in the same block of vr} 

(note the "if and only if" in the definition). For instance, for vr as in (|6.71l) . an element of (^) 
is (1, 2, 3, 1, 2); for vr as in an element of (vr) is (1, 2, 3, 4, 1, 2, 3, 5) . The following two 

facts can be easily checked; 

A) 

m 

A^xA^^ = (J U <„(vr), 

r=0 7renr(n,m) 

where the unions are disjoint; 
B) For every r = 0, m, and every vr € (n, m). 



n+m—2r 

X 



(note that the last expression does not depend on the partition vr, but only on the class 
(n, m)). 



38 



It follows that 

Jn if) Jra (g) 



-_{*) 



n+m 



E E 

' ' / RiV ( (r)\ 

r=0 Tri-)(ill%{n,m) ""'^y" > 



r=0 

m 

r=0 
m 



n\ m 



■n-\-m — 2r 



n\ m 



^n-\-m — 2r 



3n+m 



r=0 



n\ m 



r I \ r 



+m~2r 



fKa] 1a 



^n-\-m-—2r 



which is the desired conclusion. We stress that the equality (*) has been obtained by using the 
symmetry of the measure /x^"^™. The result for general f\g is deduced by an approximation 
argument and Lemma [2?6l This concludes the proof. ■ 
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